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1.  INTRODUCTION 

One  of  the  fundamental  problems  in  clustering  is  its  validation.  This  issue  has  been 
addressed  by  many  researchers  in  different  ways  depending  on  the  methods  they  use  to  solve  the 
clustering  problem,  nevertheless  the  existing  approaches  are  based  on  the  introduction  of  a  vali¬ 
dity  function  which  is  intented  to  measure  the  validity  of  the  clustering  obtained  by  the  different 
methods. 

In  the  case  of  hierarchical  methods,  the  validity  function  measures  the  compacity  of  the 
obtained  partition.  The  methods  based  on  graph  theory  lead  for  example  to  measures  of  connec¬ 
tivity,  length  of  the  chains,  etc..  The  methods  based  on  objective  functions  usually  use  the  objec¬ 
tive  function  itself  as  a  validity  function.  Other  well-known  methods  in  the  setting  of  fuzzy  clus¬ 
tering  are,  the  "degree  of  separability",  the  "partition  coefficient”,  the  "classification  entropy" 
(Bezdek,1981).  Such  a  variety  of  method-dependent  and  "ad  hoc”  measures  of  validity  suggests 
that  it  is  very  difficult  to  obtain  more  general  solutions  to  the  validity  problems  .  But,  before 
addressing  the  validity  aspect  we  think  that  it  is  necessary  to  define  what  is  a  cluster?  and 
only  after  that  it  makes  sense  to  talk  about  how  good  Is  a  cluster?. 

In  our  work  we  first  address  this  issue  and  we  give  a  new  definition  of  fuzzy  r-cluster  that 
extend  Ruspini’s  definition  (Ruspini,  1982).  Our  definition  is  based  on  the  new  concept  of  indis¬ 
tinguishability  relation  (Trillas,  1982)  which  includes,  as  particular  cases  the  concepts  of  similarity 
relation  of  Zadeh  (Zadeh,  1971),  probabilistic  relation  of  Menger  (Menger,  1951)  and  likeness  rela¬ 
tion  of  Ruspini  (Ruspini,  1982). 
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The  main  difficulty  with  the  clustering  process  is  that  the  data  and  the  obtained  partition 
are  "separated”  by  the  algorithm  that  built  such  partition,  consequently  it  is  generally  imposible 
to  gain  some  insight  about  the  "structure”  of  the  data  based  only  on  the  information  available 
observing  the  obtained  partition;  and  the  existing  methods  do  no  look  in  general  to  the  data  but 
only  the  partition. 

The  approach  presented  here  is  heavily  based  on  the  data  themselves.  First  we  construct  an 
indistinguishability  relation  among  a  data  set  (that  includes  the  prototypes  of  the  clusters)  based 
on  the  degree  of  membership,  to  the  different  clusters,  of  each  element  of  the  data  set.  Finally,  we 
measure  the  validity  comparing  (through  a  distance  measure)  the  degrees  of  membership  of  the 
data  to  the  clusters,  with  the  degrees  of  indistinguishability  between  the  data  and  the  prototypes 
of  each  cluster.  The  basic  idea  is  that  the  smaller  the  difference  between  the  degrees  of  membership 
and  the  degrees  of  indistinguishability,  the  better  the  clustering.  The  intuitive  reason  can  be 
expressed  as  follows:  The  degrees  of  membership  of  the  elements  to  a  cluster  depend  only  on  that 
cluster  but,  the  degrees  of  indistinguishability  between  the  elements  and  the  prototypes  of  the 
clusters  depend  on  the  values  of  the  degrees  of  membership  of  these  elements  and  these  proto¬ 
types  with  respect  to  all  the  clusters.  Therefore,  if  these  degrees  are  the  same,  the  membership 
functions  constitute  a  fuzzy  cluster  coverage  (the  counterpart,  for  indistinguishability  relations,  of 
partitions).  And  according  to  our  definition,  the  clustering  obtained  is  valid  if  the  membership 
functions  form  a  fuzzy  cluster  coverage.  We  have  implemented  an  algorithm  that,  given  the 
membership  functions,  measures  the  clustering  validity  and  we  give  some  results  obtained  with 
some  examples. 

We  also  suggest  that  the  validity  measure  obtained  can  be  used  to  reconsider  initial  deci¬ 
sions  about  the  election  of  the  prototypes,  the  number  of  clusters,  etc..  Also  since,  as  we  shall  see, 
and  as  a  "side  effect”  we  get  information  about  the  logical  and  metric  properties  of  the  data,  we 
could  use  such  information  in  order  to  make  a  geometrical  representation  of  the  data  (factor 
analysis,  principal  components,  etc.).  We  start  giving  some  definitions  and  theorems  whose  proofs 
are  omitted  because  of  space  reasons  (See  Valverde,  1983). 

i.  ON  INDISTINGUISHABILITY  RELATIONS 

2.1.  A  general  definition  of  indistinguishability  relations  and  their  use  in  different  fields  can  be 
found  in  (Trillas,  1982;  Trillas  and  Valverde,  1983).  Here  we  will  be  mainly  concerned  with  a  spe¬ 
cial  kind  of  these  relations;  the  F-indistinguishability  relations. 

Definition  1.  Given  a  non-empty  set  X,  a  map  E  from  XxX  into  [0,l|  is  called  F- 
Indlstlngulshablilty  relation  if  the  following  properties  hold  for  any  x,y  and  z  in  X: 

i)  E(x,y)=l, 

ii)  E(x,y)=E(y,x),  and 

iii) F(E(x,y),E(y,z))  $-<  =  •$  E(x,z). 

That  is.  F-indistinguishability  relations  are  simply  fuzzy  binary  relations  which  are  reflexive, 
symmetric  and  F-transitive,  F  being  a  continuous  t-norm.  Thus,  similarity  relations  (Zadeh  1971) 
are  F-indistinguishability  relations  with  F(x,y)=Min(x,y );  the  same  applies  to  probabilistic  rela¬ 
tions  of  indistinguishability  (Menger  1951)  with  F(x,y)=x.y  and  likenes  relations  (Ruspini  1982) 
with  F(x,y)— Max(x-I- y-1,0). 

It  is  well  known  that  likeness  relations  were  introduced  in  the  framework  of  fuzzy  clustering 
and,  among  other  considerations,  the  rationale  behind  their  definition  is  given  by  the  fact;  if  E  is 
a  likeness  relation  on  X,  then  $  d(x,y)‘  =  ‘l-E(x,y)  $  is  a  normalized  pseudo-metric  on  X;  that  is 
such  a  relation  is  given  by  (and  gives  a)  distance  between  the  elements  of  X.  In  this  way  the 
notion  of  distance,  which  has  long  been  used  in  many  contexts  as  a  measure  of  similarity,  falls 
into  the  scope  of  some  F-indistinguishability  relations.  Moreover,  we  have 

Theorem  1.  Let  d  be  a  pseudo-distance  on  a  set  X,  then 
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a)  $  L  sub  d  (x,y)'  =  ‘l/(l+ d(x,y))  $  is  a  likeness  relation  on  X,  and  viceversa. 

b)  $  P  sub  d  (x,y)‘  =  'exp(-d(x,y))  $  is  a  probabilistic  relation  on  X,  and  vice  versa. 

F-indistinguishability  relations  with  respect  to  t-norms  different  from  the  the  t-norms 
corresponding  to  likeness  and  probabilistic  relations  do  not,  in  general,  give  a  pseudo-metric  but  a 
G-pseudometric; 

Definition  2.  Given  a  continuous  t-conorm  G,  a  map  from  XxX  into  [0,1]  is  called  a  G- 
paeudometric  if  the  following  properties  hold  for  any  x,y  and  z  in  X; 

i)  m(x,x)=0, 

ii)  m(x,y)=m(y,x),  and 

iii)  G(m(x,y),m(y,z))  $">  =  *$  m(x,z). 

It  can  easily  be  seen  that,  for  any  continuous  t-norm  G, 


is  a  G-pseudometric  on  [0,l|,  where  f  is  a  continuous  and  strictly  decreasing  bijection  from  [0,1] 
into  itself,  and  I  G  sup  $  is  the  quasi-inverse  of  G,  i.e.  $  G  sup  (x'j‘y)‘  =  ‘Inf  ’’{’’a  epsi¬ 
lon  |0,lp|‘G(a,x)*>  =  ‘y‘’’}’’  $ 

The  links  between  G-pseudometrics  and  F-indistinguishability  relations  are  described  in  the 
following  theorem; 

Theorem  2.  Let  m  be  a  G-pseudometric  on  a  set  X.  Then,  for  any  continuous  and  strictly 
decreasing  bijection,  f,  from  [0,1]  into  itself,  I  E(x,y)‘  =  ‘f(m(x,y))  $  is  an  F-indistinguishability 
relation  with  respect  to  $  F(x,y)‘  =  ‘f  sup  {-1}  (G(f(x),f(y))  $  and  vice  versa. 

That  is,  any  F-indistinguishability  relation  on  a  set  X  is  determined,  up  to  an  order- 
reversing  bijection  on  the  unit  interval,  by  a  G-pseudometric  and  vice  versa. 

2.2.  In  (Ruspini  1982)  the  concept  of  fuzzy  r-cluster  is  introduced  in  the  following  way:  given  a 
reflexive  and  symetric  fuzzy  relation  r  on  X,  a  fuzzy  subset  g  of  X  is  called  fuzzy  r-cluster  if  the 
following  properties  hold  for  any  x,y  and  z  in  X: 

a)  If  g(x)=l  then  g(y)=r(x,y), 

b)  $  rg(x)'-'g(y)‘r<  =  ‘r-‘r(x,y).  $ 

It  is  shown  that  the  family  of  similarity  classes  given  by  any  likeness  relation  r  (i.e.  the 
fuzzy  subsets  of  X  defined  by  $  g  sub  x  (y)  =  r{x,y)  $  )  satisfies  the  two  properties. 

i)  $  Sup  sub  {  X  epsilon  X}  g  sub  x  (y)*  =  ‘l  $  and  $  Sup  sub  {y  epsilon  X}  g  sub  x 

(y)'  =  "l  $  (i.e.  $  ”{”  g  sub  X  sub  (x  epsilon  X}  $  is  ^  fuzzy  coverage  of  X). 

ii)  For  any  $  X  epsilon  X  $,  $  g  sub  x  $  is  a  fuzzy  r-cluster. 

Conversely,  if  for  a  given  reflexive  and  symmetric  relation  r  there  exists  a  fuzzy  coverage  of  X 
satisfying  the  above  property,  then  r  should  be  a  likeness  relation. 

Thus,  fuzzy  r-clusters  are  the  counterpart,  for  likeness  relations,  of  classical  clusters 
(equivalence  classes  with  respect  to  an  equivalence  relation).  In  order  to  extend  these  results  to 
any  F-indistinguishability  relation,  let  be  noticed  that  Ruspini’s  definition  involves  a  particular 
metric  in  the  unit  interval,  the  restriction  to  the  Euclidean  distance.  In  fact,  this  definition  may 
be  viewed  as  a  generalization  of  the  definition  of  classical  clusters  because,  at  is  easy  to  show, 
classical  clusters  can  be  characterized  by  means  of 

a)  If  $  mu  sub  A  (x)‘  =  "l  %  then  $  mu  sub  A  (y)‘  =  ‘mu  sub  R  (x,y)  $, 

b)  $  d  sub  o  (  mu  sub  A  (x),  mu  sub  A  (y))‘<  =  ‘l-  mu  sub  R  (x,y)  $, 

where  $  mu  sub  A  $  is  the  characteristic  function  of  the  set  A  and  $  d  sub  o  $  is  the  discrete  dis¬ 
tance  on  the  two-point  set  {0,1}. 
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Thus,  by  taking  different  metrics,  different  kinds  of  fuzzy  r-clusters  and,  therefore,  different 
kinds  of  F-indistinguishability  relations  r  will  be  obtained: 

Definition  S.  Let  r  be  a  reflexive  and  symmetric  fuzzy  relation  on  X.  For  a  given  continuous  t- 
conorm  G,  a  fuzzy  subset  g  of  X  will  be  termed  fuzzy  r-cluzter  with  respect  to  $  m  sub  G 
(a,b)‘  =  ‘G  sup  (f(Max(a,b))‘|‘f(Min(a,b)))  $  ,  if  the  following  properties  hold  for  any  x  and  y 
in  X: 

a)  If  g(x)=l  then  g(y)=r(x,y), 

b)  $  m  sub  G  (g(x),g(y))‘<  =  ‘f{r(x,y))  $. 

A  fuzzy  r-cluster  coverage  will  be  a  fuzzy  coverage  of  X  such  that  each  of  its  elements  is  a 
fuzzy  r-cluster. 

This  definition  extends  Ruspini’s  definition,  that  is  we  have: 

Theorem  S.  For  a  given  F-indistinguishability  relation  r  on  X,  the  family  $  C  sub  r  *  =  '”{”g  sub 
X  epsilon  [0,1]  sup  X  ‘|‘  x  epsilon  X  ”}"  $,  where  $  g  sub  x  (y)‘= ‘r(x,y)  $,  is  a  fuzzy  r-cluster 
coverage  of  X  with  respect  to  $  m  sub  G  $,  where  $  G(x,y)  =  f  sup  {-1}  (F(f(x),f(y)))  $. 

Conversely,  we  have 

Theorem  4.  Let  C  be  a  fuzzy  coverage  of  X  and  r  a  reflexive  and  symmetric  fuzzy  relation  on  X. 
If  every  element  of  C  is  a  fuzzy  r-cluster  with  respect  to  $  m  sub  G  $,  then  r  is  a  F- 
indistinguishability  relation,  where  $  F(x,y)*  =  *f(G(f  sup  {-1}  (x),f  sup  {-1}  (y)))  $. 

EXAMPLES 

(El)  If  i  G{a,b)‘  =  ‘Min(a+  b,l)  $  and  $  f(a)'  =  ‘l-a  $,  then 

and  r  is  a  likeness  relation. 

(E2)  If  $  G(a,b)*  =  *Max(a,b)  $,  then 

and  r  is  a  similarity  relation. 

(K)  If  $  G(a,b)‘  =  ‘a-(-  b-ab  $  and  $  f(a)‘  =  ‘l-a  $,  then 

and  r  is  a  probabilistic  relation. 

2.3.  Our  approach  is  heavily  based  on  the  representation  theorems  for  F-indistinguishability 
operators  (Ovchinnikov,  1982;  Valverde,  1982)  which  provide  a  way  to  construct  F- 
indistinguishability  operators.  The  following  is  the  second  author  formulation  of  that  result: 

Theorem  5. (Representation  theorem).  Let  r  be  a  reflexive  fuzzy  relation  on  X  and  let  F  be  a  con¬ 
tinuous  t-norm.  Then  r  is  a  F-indistinguishability  relation  on  X  if,  and  only  if,  there  exists  a  fam¬ 
ily  of  fuzzy  subsets  of  X,  $  (h  sub  j  )  sub  {j  epsilon  J}  $,  such  that 


where  $  F  sup  $  is  the  quasi-inverse  of  F,  i.e.  $  F  sup  (x*|*y)‘  =  "Sup’’{”a  epsilon 

[o,i]-r  F(a,x)*<=vr  t. 

Consequently,  given  a  continuous  t-norm  F,  we  can  associate  a  F-indistinguishability  rela¬ 
tion  with  any  family  of  fuzzy  subsets  of  X  and,  therefore,  a  fuzzy  r-cluster  coverage  with  respect 
to  some  G-metric  of  X  is  obtained. 


EXAMPLES 
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(E4)  If  $F(a,b)‘  =  'Min(a,b)  $,  then  the  F-indistinguishability  relation  associated  to  $  (h  sub  j  ) 
sub  {j  epsilon  J}  $  h  sub  j  epsilon  [0,1)  sup  X  $,  is  given  by 


where  $  J  sub  xy  ‘  =  j  (*)*  j 

(E5)  If  $  F(a,b)'  =  'a.b  $,  then 


(E6)  If  $  F(a,b)'  =  ‘Max(a+b-l,0)  $,  then 


We  remark  that,  since  F-indistinguishability  relations  and  G-pseudometrics  are  dual  con¬ 
cepts,  theorem  5  provides  also  a  representation  for  any  G-pseudometric. 


3.  CLUSTER  VALIDITY 

The  previous  theorem  can  be  rephrased  as  follows:  To  any  family  of  fuzzy  subsets  (member¬ 
ship  functions)  of  X  and  for  any  continuous  t-norm  F,  we  can  associate  a  fuzzy  cluster  cover¬ 
age  of  X  and,  by  duality,  a  G-pseudometric,  i.e.  a  distance  between  the  elements  of  X. 

The  basic  principle  underlying  the  cluster  validity  is  given  in  the  following 

Proposition.  If  the  membership  functions  $  ”(”  h  sub  j  sub  {j  epsilon  J)  $  constitute  a  fuzzy 
cluster  coverage,  then  there  exists  a  t-norm,  F,  such  that  the  degree  of  membership  $  h  sub  j  (x)  ( 
of  an  element  x  with  respect  to  the  cluster  j  is  the  same  as  the  degree  of  indistinguishability  (with 
respect  to  the  indistinguishability  operator  generated  by  F)  between  this  element  x  and  the  proto- 
type  of  the  cluster  j. 

In  general  the  family  $  h  sub  j  sub  {j  epsilon  J}  $  of  membership  functions  do  not 
constitute  a  fuzzy  coverage  and,  therefore,  the  degrees  of  membership  and  the  degrees  of  indis¬ 
tinguishability  will  not  coincide.  Then  we  state  the  following 

Principle  The  best  t-norm  to  be  used  to  generate  the  fuzzy  coverage  associated  with  the  family  $ 
h  sub  j  sub  (j  epsilon  J}  $  will  be  that  which  corresponds  to  the  smallest  distance  (using 
the  corresponding  $  m  sub  G  $  )  between  the  $  ”{”  h  sub  j  sub  {j  epsilon  J}  $  and  the 
correspondig  $  ”{”  g  sub  j  ”}”  sub  (j  epsilon  J}  $  in  the  indistinguishability  relation. 

Remark.  We  detect  at  the  same  time  the  best  logic  (best  t-norm,  best  t-conorm)  and  the  best 
topology  ($  m  sub  G  $)  underlying  the  structure  of  the  data  set  X. 

ALGORITHM.  Given  a  family  $  ”{”  F  sub  lambda  sub  {lambda  epsilon  GAMMA}  $  of  t- 
norms  such  that  $  F  sub  lambda  >=  F  sub  {lambda  prime}'ir  lambda  <=  lambda  prime  $  , 
and  the  family  $  ”{”  h  sub  j  ”}”  sub  {j  epsilon  J}  $  of  membership  functions,  do  the  following  for 

each  t-norm: 

(1)  Construct  the  corresponding  $  F  sub  lambda  $-indistinguishability  relation. 

(2)  For  each  cluster  j,  calculate  the  local  distance 
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where  \X\  is  the  cardinal  of  the  data  set  X  and  $  m  sub  {G  sub  lambda}  $  is  the  distance 
associated  with  the  Unorm  $  F  sub  lambda  t  (for  a  fixed  order-reversing  bijection  f  form  the 
unit  interval  into  itself). 

(3)  Calculate  the  global  distance 
where  |J|  is  the  number  of  classes. 

(4)  Select  the  set  of  $  lambda ’s  $  such  that  $  D  sup  lambda  $  is  minimum,  and  among  these, 
the  biggest  one.  The  reason  for  choosing  the  biggest  $  lambda  %  is  the  following:  if  $ 
lambda  <=  lambda  prime  $  then  $  F  sub  lambda  >=  F  sub  lambda  prime  $,  therefore  $ 
G  sub  lambda  <=  G  sub  lambda  prime  $  and  this  implies  that  $  m  sub  {G  sub  lambda) 
>=  m  sub  (G  sub  (lambda  prime}}  $  (as  it  has  been  shown  in  (Valverde  1983)).  That  is,  a 
fuzzy  coverage  with  respect  to  $  F  sub  lambda  $  is  also  a  fuzzy  coverage  with  respect  to  $  F 
sub  (lambda  prime}  $,  but  the  values  of  the  distances  measured  using  $  m  sub  (G  sub 
lambda}  t  are  bigger,  in  other  words  we  say  that$  m  sub  (G  sub  lambda}  $  has  a  higher 
resolution  power  than  $  m  sub  (G  sub  (lambda  prime}}  $. 

EXAMPLES 

We  have  tried  our  appoach  with  several  examples  involving  two  clusters,  and  we  have  con¬ 
sidered  3  different  t-norms: 

$  l)'F(x,y)‘  =  *Min(x,y)  t 
i  2)*F(x,y)‘  =  ‘x.y  i 
t  3)'F(x,y)‘  =  ‘Max(x+y-l,0)  $. 

The  order-reversing  bijection  from  the  unit  interval  into  itself  we  have  used  is  $  f(x)‘  =  'l-x 
$,  so  the  corresponding  t-conorms  are: 

$  1  prime  )‘G(x,y)‘  = ‘Max(x,y)  $ 

$  2  prime  )'G(x,y)'  =  ‘x-t-y-x.y  $ 

$  3  prime  )'G(x,y)'  = ‘Min(x4- y,l)  $. 

In  almost  all  cases,  the  best  t-norm  is  the  last  one  of  the  previous  three,  and  corresponds 
precisely  to  the  logic  of  Lukasiewicz  Aleph-1,  and  the  corresponding  $  m  sub  G  $  is  the  restriction 
to  the  unit  interval  of  the  Euclidean  distance. 


allbox;  c  s  s  c  c  c  c  n  n.  DATA  SANIPLE  #1  S  $  h  sub  1  $$  h  sub  2  $  $  x  sub  1  $0.10  0.60  $ 

X  sub  2  $0.60  0.10  $  X  sub  3  $  0.10  0.90  $  x  sub  4  $0.30  0.40  $  x  sub  5  $0.30  0.90  $  x  sub  6 
$0.10  0.60  $  X  sub  7  $  0.60  0.10  $  x  sub  8  $  0.40  0.20  $  x  sub  9  $0.30  0.80 
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allbox:  c 

c  c  c  s 

s  s 

8  S  6  C 

c  c 

c  c  c 

c  c  c 

:  c  n 

n  n  n  n  n  n  n  n.  DS  #1 

$  gll  $ 

$  g21 

$  F(x,y)‘  = 

‘Min(x,y)  $  r($ 

xi  $,$  xj  $) 

$  xl  $  $  x2  $  $  x3  $ 

$x4  $ 

$  x5 

$  x6  $ 

$  x7  $$  x8  $$  x9  $  $  xl  $ 

1  0.1  0.1  0.1  0.1  1 

0.1 

0.1  0.1 

x2 

$ 

0.1 

1 

0.1 

0.1 

0.1  0.1  0.1  0.1  0.1 

$ 

x3 

0.1 

0.1 

1 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1  $ 

x4 

0.1 

0.1 

0.1 

1 

0.4 

0.1 

0.1 

0.2 

0.4  $ 

x5 

0.1 

0.1 

0.1 

0.4 

1 

0.1 

0.1 

0.2 

0.8  $ 

x6 

1 

0.1 

0.1 

0.1 

0.1 

1 

0.1 

0.1 

0.1  ( 

x7 

0.1 

0.1 

0.1 

0.1 

0.1 

0.1 

1 

0.1 

0.1  t 

x8 

0.1 

0.1 

0.1 

0.2 

0.2 

0.1 

0.1 

1 

0.2  $ 

x9 

0.1 

0.1 

0.1 

0.4 

0.8 

0.1 

0.1 

0.2 

1 

Cluster  #1;  $  d  sub  1  sup  lambda  ‘  =  ‘0.90  $ 
Cluster  #2:  $  d  sub  2  sup  lambda  ‘='0.90  t 
t  D  sup  lambda  ‘=‘0.90  $ 
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allbox;  ccccsssssscccccccccccnnnnnnnnn.  DS#1 

$  $  F(x,y)‘  =  ‘xy  $  r($  xi  $,$  xj  $)  I  xl  $  $  x2  $  $  x3  $ 

$  $  x6  $  $  x7  $$  x8  «  x9  $  $  xl  $  1  0.17  0.67  0.33  0.33  1 

$  x2  $  0.17  1  0.11  0.25  0.11p0.17  1  0.5  0.13 


0.67 

0.11 

1 

0.33 

0.33 

0.67 

0.11 

0.22 

0.33 

0.33 

0.25 

0.33 

1 

0.44 

0.33 

0.25 

0  5 

0.5 

0.33 

0.11 

0.33 

0.44 

1 

0.33 

0.11 

0.25 

0.89 

1 

0.17 

0.67 

0.33 

0.33 

1 

0.17 

0.25 

0.33 

0.17 

1 

0.11 

0.25 

0.11 

0.17 

1 

0.5 

0.13 

0.25 

0,5 

0.22 

0.5 

0.25 

0.25 

0.5 

1 

0.25 

0.33 

0.13 

0.33 

0.5 

0.89 

0.33 

0.13 

0.25 

1 

$  gll  $  t  g21 

$  x4  $  $  x5 

0.17  0.25  0.33 
)  x3 

x4 
x5 
x6 
x7 
x8 
x9 


Cluster  #1:  $  d  sub  1  sup  lambda  '  =  ‘0.446  $ 
Cluster  #2;  $  d  sub  2  sup  lambda  '='0.685  $ 
$  D  sup  lambda  '  =  '0.565  $ 


allbox:  c  c 

c  c 

s  s  s  s 

s  s  c 

c  c  c 

c  c  c 

c  c 

c  c  n 

n  D 

n  n  D  D  D  D. 

DS 

#1 

X  gll  X 

X  g21 

$ 

$  F(x,y)' 

=  'Max(x+  y-1,0) 

t  r($ 

xi 

xj 

t) 

$  xl  $ 

$  x2 

$ 

X  X3  $ 

$  x4 

$ 

t 

x5 

t 

% 

x6 

$ 

$  x7 

$$  x8 

$$ 

x9 

X  i 

;  xl 

$ 

1 

0.5 

0.7 

0.8 

0.7 

1 

0.5 

0.6 

0.8 

% 

x2 

$ 

0.5 

1 

0.2 

0.7 

0.2 

0.5 

1 

0.8 

0.3 

$ 

x3 

$ 

0.7 

0.2 

1 

0.5 

0.8 

0.7 

0.2 

0.3 

0.8 

$ 

x4 

$ 

0.8 

0.7 

0.5 

1 

0.5 

0.8 

0.7 

0.8 

0.6 

$ 

x5 

$ 

0.7 

0.2 

0.8 

0.5 

1 

0.7 

0.2 

0.3 

0.9 

$ 

x6 

$ 

1 

0.5 

0.7 

0.8 

0.7 

1 

0.5 

0.6 

0.8 

$ 

x7 

t 

0.5 

1 

0.2 

0.7 

0.2 

0.5 

1 

0.8 

0.3 

$ 

x8 

$ 

0.6 

0.8 

0.3 

0.8 

0.3 

0.6 

0.8 

1 

0.4 

$ 

x9 

1 

0.8 

0.3 

0.8 

0.6 

0.9 

0.8 

0.3 

0.4 

1 

Cluster  #1;  $  d  sub  1  sup  lambda  '  =  '0.467  I 
Cluster  #2:  $  d  sub  2  sup  lambda  '  =  '0.533  $ 
$  D  sup  lambda  '  =  '0.500  $ 
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allbox;  cssccccnn.  DATA  SAMPLE  i^2  S  $  h  sub  1  $$  h  sub  2$$xl$  1  0$ 

x2  $  0  1  $  x3  $  0.80  0.10  $  x4  $  0.90  0.20  $  x5  I  0.50  0.90  $  x6 

$  0.40  0.90  $  x7  $  0.10  0.80  $  x8  $  0.50  0.70 


allbox;  c 

c  c  c 

s  S  S  5  f 

c  c 

c  c  c 

c  c  c 

c  c  n 

n  n  n  n  n  D  D.  DS  #2 

$  gll  $ 

t  g21  $ 

$ 

F(x,y)‘  = 

‘Min(x,y)  $  r 

(  $  xi  $,$  xj  $) 

$  xl  $ 

$  x2  $  $  x3  $ 

$  x4  $ 

$  x5  $ 

$ 

x6  $ 

$ 

x7  $$ 

x8 

$ 

$  xl  $ 

10  0  0 

0  0 

0  0  $ 

x2 

$  0 

1 

0 

0 

0 

0 

0 

0 

$ 

x3 

$  0 

0 

1 

0.1 

0.1 

0.1 

0.1 

0.1 

$ 

x4 

1  0 

0 

0.1 

1 

0.2 

0.2 

0.1 

0.2 

$ 

x5 

$  0 

0 

0.1 

0.2 

1 

0.4 

0.1 

0.7 

$ 

x6 

$  0 

0 

0.1 

0.2 

0.4 

1 

0.1 

0.4 

$ 

x7 

$  0 

0 

0.1 

0.1 

0.1 

0.1 

1 

0.1 

$ 

x8 

$  0 

0 

0.1 

0.2 

0.7 

0.4 

0.1 

1 

Cluster  #1:  $  d  sub  1  sup  lambda  *  =  ‘0.75  I 
Cluster  #2:  $  d  sub  2  sup  lambda  ‘  =  ‘0.75  $ 
$  D  sup  lambda  ‘  =  *0.75  $ 


allbox;  ccccsssssccccccccccnnnnnnnn.  DS#2  $gll$ 
F(x,y)‘  =  ‘xy  $  r(  $  xi  $,$  xj  $)  $  xl  $  I  x2  $  $  x3  $  $  x4  $ 

$  $  x7  $$  x8  $  $  xl  $  1  0  0  0  0  0 

$01000000  $ 

$  0  0  1  0.5  0.11  0.11  0.13  0.14  $ 

$  0  0  0.5  1  0.22  0.22  0.13  0.29  $ 

$  0  0  0.11  0.22  1  0.8  0.2  0.78  $ 

$  0  0  0.11  0.22  0.8  1  0.25  0.78  $ 

$  0  0  0.13  0.13  0.2  0.25  1  0.2  $ 

$  0  0  0.14  0.29  0.78  0.78  0.2  1 


$  g21  $ 

$  x5  $ 

0  0$ 


$ 

$  x6 
x2 
x3 
x4 
x5 
x6 
x7 
x8 


Cluster  #1:  t  d  sub  1  sup  lambda  ‘=‘0.40  $ 
Cluster  #2:  $  d  sub  2  sup  lambda  ‘  =  ‘0.45  $ 
$  D  sup  lambda  ‘=‘0.425  $ 
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allbox;  c  c 

CCS 

s  s 

see 

c  c  c 

c  c 

;  c  c  n 

n  n  n  n  n 

n  n.  DS  #2  $ 

gll  $ 

t  g21  $ 

$ 

F(x,y)'  =  'Max(x 

+  y* 

1.0)  % 

r($  xi 

tA 

xj  $) 

$  xl  1 

$  x2  (  $  x3 

$ 

$  x4  $ 

$  x5 

$  $  x6 

$ 

$ 

x7  $$ 

x8  $  $ 

xl  $ 

1  0 

0.8  0.8  0.1 

0.1 

0.1  0.3 

$  x2 

1  0 

1 

0.1 

0.1 

0.5 

0.6 

0.8 

0.5 

s 

x3 

$  0.8 

0.1 

1 

0.9 

0.2 

0.2 

0.3 

0.4 

t 

x4 

$  0.8 

0.1 

0.9 

1 

0.3 

0.3 

0.2 

0.5 

% 

x5 

$  0.1 

0.5 

0.2 

0.3 

1 

0.9 

0.6 

0.8 

$ 

x6 

$  0.1 

0.6 

0.2 

0.3 

0.9 

1 

0.7 

0.8 

t 

x7 

t  0.1 

0.8 

0.3 

0.2 

0.6 

0.7 

1 

0.6 

$ 

x8 

$  0.3 

0.5 

0.4 

0.5 

0.8 

0.8 

0.6 

1 

Cluster  #1 

$  d  sub  ] 

sup  lambda 

'0.125  t 

Cluster  #2 

$  d  sub  2 

sup  lambda 

‘0.125  j 

$  D  sup  lambda  '  =  '0.125  $ 

Concluding  remarks. 

We  have  generalized  some  previous  results  of  Ruspini  and  others  in  fuzzy  clustering,  using 
the  new  concept  of  indistinguishability  relation  based  on  the  concept  of  t-norm  and  also  we  have 
studied  its  metrical  properties  through  the  dual  concept  of  t-conorm  that  leads  to  G- 
pseudometrics.  From  the  concept  of  G-pseudometric  we  have  defined  fuzzy  r-clusters  and  fuzzy 
cluster  coverages.  Finally,  we  have  proposed  a  measure  of  cluster  validity  based  on  the  concept  of 
fuzzy  coverage. 

It  is  important  to  notice  that  the  process  of  measuring  the  validity  is  carried  out  before  any 
decision  concerning  the  assignments  of  elements  to  the  clusters.  Therefore  we  can  postpone  the 
decision  step  until  we  have  an  acceptable  cluster  validity.  That  is,  we  have  a  sort  of  closed  loop  in 
the  sens?  that  a  bad  measure  of  validity  obliges  the  user  to  reconsider  previous  hypothesis  like, 
for  example,  the  number  of  clusters,  the  values  of  the  prototypes,  etc..  Right  now  we  are  studying 
this  "close  loop”  aspect  in  the  setting  of  different  classification  algorithms. 
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